METHODS AND ARRANGEMENTS FOR 
ORDERING CHANGES IN COMPUTING SYSTEMS 

Field of the Invention 

The present invention relates to distributed computing systems and, more particularly, 
5 to arrangements and methods for automatically determining allowable sequences of changes, 
e.g., sequences where the order in which changes are carried out will transition a computing 
system from a workable state into another workable state. 

Background of the Invention 

Implementing changes for hardware, software, network and storage systems in large- 
10 scale eBusiness environments remains painful to customers: Rolling out changes, such as 
(un)installing, upgrading or configuring systems can take weeks, partly because the complex 
interdependencies between applications and their supporting services are not made explicit, thus 
requiring human involvement and expertise. Solving this change management problem 
automatically is important to address the increasing complexity of computing systems. The 
1 5 number of relationships of a single managed resource (a software artifact, a network 

component, a storage system) range from 10 to 100; the number of managed resource instance 
relationships in large-scale enterprise systems is often between 1,000,000 and 1,000,000,000. 



YOR920030547US1 



- l - 



Given that a change to one or more managed resources may entail additional changes on a 
multitude of other managed resources, it is evident that the need for human involvement in the 
change management process needs to be minimized. This motivates the need for a generic 
approach that discovers allowable sequences of changes by interacting with the target systems. 

5 The identification and tracking of relationships between the components of distributed 

systems is becoming increasingly important for change management. Software artifacts and 
their components rely on a variety of supporting artifacts. Consequently, applying a change to 
one artifact affects other artifacts, i.e., artifacts have dependencies on other artifacts. They 
exist between the components of different artifacts on a single system and also between the 

10 artifacts on multiple systems and organizational domains. Artifacts that depend on others are 
referred to as dependents, while artifacts on which other artifacts depend are referred to as 
antecedents. It is important to note that an artifact often plays both roles (e.g., a name service 
is required by many applications and services but is depending itself on the proper functioning 
of other services, such as the operating system and the network protocols and infrastructure), 

1 5 thus leading to a dependency hierarchy that can be modeled as a directed acyclic graph (DAG). 
Furthermore, dependency relationships are transitive, i.e., the dependent of a given component 
requires, in addition to the component itself, also the components' antecedent(s). Dependencies 
exist between various artifacts of a distributed system, such as end-user services, system 
services, applications and their logical and physical components. 
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Prior art in the area of software development [US4751635], [US5960196] and 
maintenance [US5493682] deals with individual software elements and modules that form the 
atomic parts of a program package and require the availability of program source code in order 
to build software and bundle it into software products. Source code is available to the software 
5 developer and not to the service user. 

Prior art in the area of software packaging [US5835777] deals with individual software 
elements and modules that form the atomic parts of a program package and require the 
availability of program source code in order to build software and bundle it into software 
products. 

10 [IEEE 1387.2 1995] addresses software distribution/deployment/installation. It 

defines a mechanism for ensuring that new software components (which are going to be 
installed) do not conflict with an already existing software installation. It identifies three kinds 
of relationships prerequisite, exrequisite, corequisite that facilitate such compatibility checks. 
This is done individually for every system on which new software needs to be installed. The 

1 5 software inventories present on other systems are not taken into account. Furthermore, this 

IEEE specification does not deal with instantiated applications and services and therefore does 
not represent any means of determining the dependencies between components at runtime. 
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[OpenGroup 1998] extends [IEEE 1387.2 1995] by defining several commands 
(swinstall, swlist, swmodify, etc.) that are invoked by software installation tools on a specific 
system. It also defines a software definition file format to make sure that the information 
required by the aforementioned commands is available from the system on which the 
5 commands are invoked. The shortcomings of [IEEE 1387.2 1995] (confined to a single 

isolated system, with no means for determining software dependencies at runtime) also apply to 
this specification. 

Current Operating System Inventory implementations (such as the IBM AIX 
Object Data Manager (ODM), the Linux Red Hat Package Manager (RPM) or the 
10 Microsoft Windows Registry) follow either [OpenGroup 1998] and [IEEE 1387.2 1995] 
or describe the software inventory in a proprietary format. Thus, the aforementioned 
limitations also apply to them. 

Techniques for electronic software distribution of whole program packages 
[US6009525] [US5721824] or updates/coirections/fixes/patches [US5999740] [US5805891] 
15 [US5953533] are, by definition, restricted to the distribution/deployment/installation of (one 
or many at a time) physical software packages and do not take the runtime stages of 
applications into account. In addition, they deal with one system at a time and do not take the 
cross-system aspects of applications and services into account. 



YOR920030547US1 



-4- 



Techniques for determining conflicts in existing software/hardware configurations 
[US58677 14] are also confined to a single system and do not take runtime aspects into account. 

There thus exists a need to describe a generic approach to discover allowable 
sequences of changes of artifacts, which prior art (such as the U.S. Patent Application Serial 
5 No. 09/755,786, filed on January 5, 2001, and entided "Systems and Methods for Service- 
and Rolebased Software Distribution") does not take into account. There is also a further need 
to determine dependency relationships in distributed systems (as disclosed in U.S. Patent 
Application Serial No. 10/241,162, filed on September 1 1, 2002, and entided "Methods and 
Apparatus for Managing Dependencies in Distributed Systems"), and transforms such 
10 acquired relationships into task sequences that are linked by temporal ordering constraints. 

Examples of such constraints are: "Task X must finish before task Y can begin, Task X cannot 
start until task Y does, Task X cannot finish before task Y does, Task X cannot finish until 
task Y starts". These constraints apply to various types of change tasks, such as install, 
uninstall, configure, start, stop. 

15 Summary of the Invention 

The present invention broadly directed to automatically determining allowable 
sequences of changes, i.e., the order in which changes are carried out will transition the target 
systems from a workable state into another workable state. 
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In summary, one aspect of the invention provides a method of determining an 
allowable order of changes in a distributed system, the method comprising the steps of 
determining existing relationship descriptions between components of the system; 
transforming acquired relationships into ordered tasks that are linked by temporal ordering 
constraints; and creating an order of changes taking into account task relationship constraints. 

Another aspect of the present invention provides a system for determining an 
allowable order of changes in a distributed system, the system comprising an arrangement for 
determining existing relationship descriptions between components of the system; an 
arrangement for transforming acquired relationships into ordered tasks that are linked by 
temporal ordering constraints; and an arrangement for creating an order of changes taking into 
account task relationship constraints. 

An additional aspect of the present invention provides a program storage device 
readable by machine, tangibly embodying a program of instructions executable by the 
machine to perform method steps for determining an allowable order of changes in a 
distributed system, said method comprising the steps of determining existing relationship 
descriptions between components of the system; transforming acquired relationships into 
ordered tasks that are linked by temporal ordering constraints; and creating an order of changes 
taking into account task relationship constraints. 
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For a better understanding of the present invention, together with other and further 
features and advantages thereof, reference is made to the following description, taken in 
conjunction with the accompanying drawings, and the scope of the invention will be 
pointed out in the appended claims. 

Brief Description of the Drawings 

Figure 1 is a block diagram illustrating the data flows between the various components 
involved in a system for ordering changes according to an embodiment of the present invention. 

Figure 2 is a block diagram illustrating the topology as well as the dependency 
relationships of an eCommerce Application System according to an embodiment of the present 
invention. 

Figure 3 is a block diagram illustrating the task relationships and relationship 
constraints according to an embodiment of the present invention. 

Figure 4 is a block diagram illustrating the task graph as a result of the task 
consolidation step according to an embodiment of the present invention. 

Figure 5 is a block diagram illustrating the annotated task graph as a result of the task 
annotation step according to an embodiment of the present invention. 
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Figure 6 is a flow diagram illustrating steps of actions for determining the order of 
Changes and Task Durations according to an embodiment of the present invention. 

Figure 7 is a flow diagram illustrating steps of actions for the construction of a Task 
Graph from Dependency Information according to an embodiment of the present invention. 

5 Figure 8 depicts examples of Task Graph Builder APIs according to an embodiment of 

the present invention. 

Description of the Preferred Embodiments 

Several other copending and commonly owned U.S. patent applications, filed 
concurrently herewith, disclose various processes and arrangements whose details may, in 

10 the role of background information, help provide a better understanding of one or more of 
the embodiments disclosed and contemplated herein. Accordingly, those applications are 
hereby fully incorporated by reference as if set forth in their entirety herein, and are as 
follows (including the title and attorney docket number for each one): "Methods And 
Arrangements for Automated Change Plan Construction and Impact Analysis" (Docket 

15 No. YOR920030548US 1); and "Methods and Arrangements for Planning and Scheduling 
Change Management Requests in Computing Systems" (Docket No. 
YOR920030549US1). 
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The change management process starts with the submission of a Request For Change 
(RFC), which is viewed as a job in scheduling terms. Many RFCs may be submitted 
concurrently. The RFC describes what is to be done, usually in terms of hardware/software 
artifacts to change (deploy, install, configure, uninstall), as well as the deadline by which the 
5 change needs to be completed. Examples include changing the schema of a database table in a 
running application and installing a new release of a web application server in a multi-tiered 
eCommerce system. An important observation is that many changes are not explicitly 
included in the RFC. Rather, they are merely implied. For example, applications must be 
recompiled if they use a database table whose schema is to change. Such implicit changes are a 
10 result of various kinds of relationships, such as service dependencies and resource sharing. 

An RFC preferably contains the name of the artifact(s) that need to be changed, the 
name(s) of the target system(s) and the requested operation (e.g., "update the orderDisplay 
and buyConfirmation servlets, as well as the credit card transactions - CC XACTS - database 
table"). In addition, the RFC contains the deadline (time/date) by which the change must be 
1 5 completed, as well as its maximum allowable duration (e.g., a maintenance interval with a 
length of 2 hours, ending at Sam). Note that an RFC is declarative by stating what needs to 
be accomplished, but leaves the procedural details (i.e., how the change is carried out) open. 
Based on the submitted RFC, the system disclosed in the present invention, the Task Graph 
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Builder, determines the allowable order of the tasks that are necessary to fulfill the RFC. To 
do so, it may exploit two sources of dependency information: 

1. The first source comprises Deployment Descriptors that annotate software 
packages, which reside in a software repository or on a software installation server. 

5 Deployment descriptors (such as the ones used by Linux RPM or AIX installp packages) 

provide meta-information, gathered at build time (and preferably automatically generated by 
the development tools), about a software package, such as identifying and version data, and 
dependency information. This dependency information lists the pre-requisites (packages that 
must be present on the system for an installation to succeed), the co-requisites (packages that 
10 must be jointly installed) as well as ex-requisites (packages that must be removed prior to 
installing a new package). 

2. In addition to the dependency information available at build time, one needs to 
consider runtime dependency information that may vary over time and with the workload the 
system is subject to. In contrast to the dependency information captured in deployment 

15 descriptors, a Runtime Dependency Model captures dependencies that typically cross system 
boundaries. 

However, one having skill in the relevant art will recognize that modifications in the 
way how dependencies are obtained by the task graph builder, as well as their representation, 
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may be made without departing from the spirit and scope of the present invention. In particular, 
additional sources of dependency information may be available. With this information, 
containing the actual dependencies between the artifacts of a distributed system, the Task Graph 
Builder is able to determine the steps of a change as well as the order in which they have to be 
5 carried out. A representation of such information is called a Task Graph. An Annotated Task 
Graph comprises a task graph as well as time estimates for every task within the task graph; 
these estimates may have been obtained from previous deployments. Information stored 
within a task graph is specific to a given combination of artifacts, and may be decoupled from 
the target systems and their characteristics (e.g., CPU speed, RAM, free/available disk space). 

1 0 The purpose of the Task Graph Builder is to create reusable Task Graphs for various 

change management operations from existing dependency descriptions. It was noted above 
that task graphs describe the order in which tasks need to be carried out to transition a system 
from a workable state into another workable state. The order described may be the complete 
or total order, or something less, in which case the order described would be a partial order. It 

15 is presently preferred that the order described be a partial order. In order to achieve this, a task 
graph may contain information about: 

• The change management operation that needs to be carried out, e.g., install, 
update, configure, uninstall, 
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• the roles and names of the artifacts that are subject to a change (either directly 
specified in the RFC, or determined by the task graph builder), 

• the temporal and location constraints that may exist between tasks, based on artifact 
dependency information, 

• an estimate of how long every task is likely to take, based on the results of several 
previous deployments. This is needed to estimate the impact of a change in terms of 
downtime. 

In a specific embodiment of the present invention, a task graph may contain - in 
addition to the aforementioned data - information that relates to the specific hardware 
characteristics of a target system (such as CPU speed, RAM or total/available disk space) or 
names and IP addresses of target systems. It should be recognized, however, that 
modifications in the way of what data is contained within task graphs, as well as their 
representation, may be made without departing from the spirit and scope of the present 
invention 

The invention adds architectural elements to a change management system (such as the 
concurrently-filed U.S. patent application respectively identified as: attorney docket no. 
YOR920030548US1 entitled "Arrangements and Methods for Automated Change Plan 
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Construction and Impact Analysis") that enable it to initiate a change, trigger the acquisition of 
dependency relationship information along with temporal constraints, and its subsequent 
automated processing into change sequences. In order to achieve maximum efficiency of the 
change management process, this invention determines in which order changes need to be 
5 carried out to transition the target systems from a workable state into another workable state. In 
addition, the present invention determines whether changes are conflicting, and flags such 
potential violations to avoid breaking a system. The output of the invention can be consumed 
and modified by applications, comprising planning tools, schedulers, workflow editors, 
workflow management engines and automated provisioning systems for data centers, or by 
10 enterprise software distribution and configuration tools. 

In addition, the present invention is able to automatically refine an incoming request 
for change by breaking it down into atomic tasks. 

The present invention takes operational policies into account that define best practices. 
Examples of such policies are: "A application server must always be installed on a different 
1 5 system as a database server", "A specific version of a database management system must be 
present for an application server to function properly", "A servlet must be first quiesced before 
it can be upgraded", "On a clustered computing system, all application servers must have the 
same version and release levels". 



YOR920030547US1 



- 13- 



Finally, the invention leverages state models to determine which state transitions are 
allowed, such as the one defined in the CM Application Model. Examples of such state 
transitions are: "from state installable to state executable", but not "from state installable to 
state running". The method described in the present invention consists in reading the acquired 
5 relationship descriptions along with the temporal ordering constraints and combining them 
into a change task sequence. In particular, the method described here uses the relationship 
descriptions to determine whether change tasks must be carried out sequentially, or whether 
some (or all) of them can be carried out in parallel. It should be noted that the applicability 
of this invention is not confined to the problem area of software distribution, but can be used 
10 for all sorts of changes, such as (re)configuration of computing and software systems. 

Referring now to Figure 1 an architecture as well as the data flows between the various 
components involved in a system for ordering changes according to an embodiment of the 
present invention is depicted. It is assumed that Managed Resources (160) are able to 
provide descriptions of their system inventory, configuration files and their various 
1 5 dependencies (however, it should be noted that any data description format suits the purpose 
of the invention as well). The details on how this information can be acquired are as follows: 

• The most straightforward way is to provide appropriate instrumentation within the 
system and its applications and services; this information would then be exposed by means of 
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Descriptors, e.g., as flat XML files (145) and made available to the other components of the 
system through a Web Server (135). 

• Alternatively, the Dependency Service (125) makes use of information stored in 
System Repositories (150) for generating appropriate service dependency information. This 

5 information would then be made available to the other components of the system through a 
Webserver (135). 

• Third, the Managed Resources (160) could expose their information by means of 
an instrumentation agent, called Common Information Model (CM) Provider (155), which 
interacts with a CIM Object Manager (CIMOM) (140), as proposed by the Distributed 

10 Management Task Force (DMTF). The CIMOM would then expose the necessary information 
to the interested components. 

In the center of the figure various management services are depicted. These are: a 
VMResolver Service (115), a Task and Job Duration Estimator Service (120), and the 
Dependency Service (125). 

1 5 The change management process starts with the submission of a Request for 

Change (RFC) (105) to the Task Graph Builder (110) by the administrator (100). The 
process of creating the task graph, executed by the Task Graph Builder, comprises the 
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following steps: 

• Mapping the logical target names to a list of physical target systems, if needed. 

• Collecting the required dependency information, either directly from managed 
resources (160), or from an intermediate data store. 

5 • Estimating the duration for each task in the task graph as well as the overall duration 

(the makespan) of the job represented by the task graph, if needed. 

• Creating an Annotated Task Graph by attaching task and job duration information, 
if needed. 

• Delivering the (Annotated) Task Graph back to the Administrator. These steps are 
10 detailed below. 

The RFC may, in addition to the artifact name and various other information described 
above, either explicitly identify the target system(s), refer to them via their role (e.g., Database 
Server, Web Application Server), or provide a logical name (or alias) for the targets. A 
common example for the latter, used e.g., in on demand environments, is "database server 
15 cluster on which customer X's data is hosted". This logical name maps to a set of physical 
target systems. If logical names are used, the VMResolver (115) needs to bind given logical 
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names to physical target systems at the time when the RFC is executed (vs. at the time when 
it is defined). In the software engineering literature, this technique is termed "late binding". 

Once the artifact(s) that are subject to a change, the change management operation, and 
the target system names (or their roles) are determined, the Dependency Service (125) is 
invoked by the Task Graph Builder (110). The main tasks of the Dependency Service (125) 
are as follows: 

• Expose a 'drill-down' method that, upon receiving the identifier of a service, 
returns: (1) either descriptions of its direct antecedents, i.e., the first level below the node 
representing the service, (2) the whole subgraph below the node representing the service, or 
(3) an arbitrary subset of the dependency graph (levels m to n below a given node). 

• Provide a 'drill-up' method with the same facilities, targeting the dependents of 
the service. 

• Additional methods for gathering and filtering information for classes and 
properties of managed objects are present. 

• Obtaining the dependency information from the Managed Resources (160) by 
issuing queries over http and applying filtering rules (as specified by the administrator (100) or 
the task graph builder (110)) to it. 
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• Combining the information into a data structure that is sent back to the 
management system as document. 

The Dependency Service (125) processes the request, gathers all required dependencies 
and sends the results back to the Task Graph Builder (110), which - in turn - computes the 
5 Task Graph (165). Then, the Task Graph Builder collects - for every task within the task graph 
- the estimated task duration as well as the duration of the overall job represented by the task 
graph. It does this by querying the Task & Job Duration Estimator (120). Once the durations 
for the tasks and the overall job are determined, it annotates the task graph (165) with this 
information. Finally, the Task Graph Builder (110) delivers the Task Graph (165) to the 
1 0 Administrator (100). 

Referring now to Figure 2 the topology as well as the dependency relationships of an 
eCommerce Application System, according to an embodiment of the present invention is 
depicted. Such a relationship model focuses on software artifacts and their logical (modules, 
components) and physical (files, shared libraries) architecture. It captures the detailed 
1 5 descriptions of SW components, i.e., the system inventory, which is usually recorded in the 

various system repositories or in well-defined places e.g., the configuration files of a Managed 
Resource (160). Examples of system repositories include, but are not limited to the IBM ADC 
Object Data Manager (ODM), the Linux Red Hat Package Manager (RPM) or the Microsoft 
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Windows Registry. Information relating to software components is typically captured during 
the installation and deployment of a software package. In addition, the relationship model 
contains the dependencies between the various system components, depicted as arrows. For the 
sake of clarity, the names of the artifact types are written in normal typeface while the names 
5 of the products implementing them structural model are written in italic in Figure 2 

Host system "X" (265) plays the role of a Web Application Server and hosts the 
following components: The E-business Application, which is preferably implemented by a 
total of 14 Servlets (200, 205, 210). The latter encapsulate the business logic of the 
application. The Servlet Container (240) is preferably implemented by IBM WebSphere 
10 Application Server (WAS) Servlet Container. The Operating System (OS) is preferably 
Linux version 7.2 (245). 

Host system "Y" (270) plays the role of a Database Server and hosts the following 
components: 10 Database tables (235, 250) that hold the data accessed by the Servlets (200, 
205, 210). The database tables reside within a Database preferably implemented by (IBM) 
15 DB2 Universal Database (UDB) version 8. 1 (255), and an Operating System (OS), here 
preferably (IBM) Advanced Interactive Executive (AK) version 5.1 (260). 

It is assumed that the RFC (105) submitted to the Task Graph Builder (110) specifies 
that two Servlets, Bestsellers (bestsell) (205) and OrderDisplay (ordrdisp) (210) need to be 
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installed on Host System "X" (265). It is further assumed that the Operating System (245) is 
already installed on Host System "X" (265); in addition, it is assumed the presence of an 
Operating System (260) on Host System "Y" (270). Finally, we assume that the change 
management system is subject to an operational policy, stating that a Servlet Container must be 
5 installed on a different system than a Database. 

The invocation of the Dependency Service for this RFC yields the following 
dependencies (depicted in FIG. 2 by dashed lines): The bestsell Servlet (205) depends only on 
the Servlet Container (240) on Host System "X" (265). This dependency is illustrated by the 
arrow labeled (215). The ordrdisp Servlet (210) - in contrast - depends on both the Servlet 

1 0 Container (240) on Host System "X" (265) as well as on the Credit Card Transaction Table 
(CC_XACTS) (235) on Host System "Y" (270). The former dependency is illustrated by the 
arrow labeled (225); the latter by the arrow labeled (230). Determining the allowable partial 
order in which the following artifacts can be installed on two systems is of particular interest: 
bestsell Servlet (205), ordrdisp Servlet (210), Servlet Container (240), CC_XACTS Table 

15 (235), Database (255). 

Referring now to Figure 3, task relationships and relationship constraints, according to 
an embodiment of the present invention are depicted. Typically, dependency information is 
available in units that specify for every artifact which other artifacts it requires. Such 



YOR920030547US1 



-20- 



information is directly available, either from the descriptors of individual software packages (that 
list the pre-, co-, and ex-requisites of an artifact), or from system repositories. The system 
disclosed in U.S. Patent Application Serial No. 10/241,162, filed on September 11, 2002, and 
entitled "Methods and Apparatus for managing dependencies in distributed systems" provides 
5 a way to consolidate these atomic units of dependency information into a dependency graph 
that may span multiple systems by traversing, step by step, descriptors and/or repositories. The 
core component is a Dependency Service (125) that provides an API to execute operations for 
(recursively) traversing a dependency graph from the top to the bottom (drill^down), or in the 
opposite direction (drill- up). In the present embodiment, the Task Graph Builder (110) 
10 exploits this implementation by invoking the Dependency Service (125) and evaluating the 
returned dependency graph to determine whether tasks implied by a change must be carried 
out sequentially, or whether some of them can be carried out concurrently. 

Different change management operations require different traversals through the 
dependency models: A request for a new installation of a software artifact leads the Task Graph 
1 5 Builder (110) to invoke a recursive drill-down operation on the Dependency Query Facility to 
determine which artifacts must already be present before a new artifact can be installed. On the 
other hand, a request for an update, or an uninstall of an artifact leads to the invocation of a 
recursive drill-up query to determine the artifacts that will be impacted by the change. The 
present embodiment preferably uses four of temporal constraint types: 
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• Finish-to-Start (FS): This temporal constraint expresses that task A must finish 
before task B can begin and is the default constraint in workflow management systems. An 
example in a TPC-W eCommerce context is that a servlet container must be running (i.e., the 
task of starting it must be completed) before a new servlet can be deployed to it. 

5 • Start-to-Start (SS): Task B cannot start until task A does. An example for this 

constraint type are nested transactions and units of work. 

• Finish-to-Finish (FF): Task B cannot finish before task A does. Example: One 
cannot shutdown a system if the web application server is still running. 

• Start-to-Finish (SF): Task B cannot finish until task A starts. Example: a failover 
10 server cannot be taken offline before the main server is up again. Note that there is a subtle 

difference between this constraint type and the aforementioned FS constraint type, because here 
the start of a task determines the end of its predecessor (in the simpler FS case, the start of a task 
depends on the ending of its predecessor). 

The evaluation of the RFC (105) by the Task Graph Builder (110) stating that an 
15 installation change management operation needs to be carried out, and the consideration of 
the results of the relationship traversal, carried out by the Dependency Service (125), yield that 
the following tasks are subject to relationship constraints: 
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• The "Install Servlet Container on Host System "X"" task (300) must be finished 
("FS" type relationship constraint) (325) before the "Install bestsell Servlet on Host System 
"X"" task (315) can be started. - 

• The "Install Servlet Container on Host System "X"" task (300) must be finished 
5 ("FS" type relationship constraint) (325) before the "Install ordrdisp Servlet on Host System 

"X"" task (320) can be started. 

• The "Install CC_XACTS Table on Host System "Y"" task (305) must be finished 
("FS" type relationship constraint) (325) before the "Install ordrdisp Servlet on Host System 
"X"" task (320) can be started. 

10 • The "Install Database on Host System "Y"" task (310) must be finished ("FS" 

type relationship constraint) (325) before the "Install CC_XACTS Table on Host System "Y"" 
task (305) can be started. 

With this information, the Task Graph Builder (110) can proceed with consolidating the tasks. 

Referring now to Figure 4, a task graph for installing the bestsell and ordrdisp servlets 
15 of the Internet storefront application, according to an embodiment of the present invention is 
depicted. The existence of a dependency between two artifacts indicates that a relationship 
(FS, SS, FF, SF) constraint or a location constraint (e.g., a policy forbidding collocation of 
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Database and Servlet Container) exists between the dependent and the antecedent. The Task 
Graph Builder preferably observes the following rules: 

• Any task may have zero or more incoming and outgoing links. 

• If - within a set of task relationships - a task is the predecessor of several other 
different tasks, one instance of this task is chosen and for every succeeding task an outgoing link 
is attached to this task. 

• If - within a set of task relationships - a task is succeeding several other different 
tasks, one instance of this task is chosen and for every preceding task an incoming link is 
attached to this task. 

• If a relationship constraint exists between two tasks, they need to be carried out 
within a sequence. 

• If two tasks share the same predecessor and no temporal constraints exist between 
them, they can be executed concurrently within a flow. 

• The container for grouping tasks and their constraints on a per-host basis is a 
sequence. 
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• The outermost container for grouping per-host sequences is a process. 

By following the above rules, the Task Graph Builder (110) is able to consolidate the 
Task Relationships depicted in Figure 3 into the following Task Graph (165) consisting of two 
sequences that are grouped on a per-host basis and aggregated into a process. 

5 • The "Host X sequence" consists of the following tasks and links: The "Install 

Servlet Container on Host System "X" task (400) has two outgoing "FS"-type links (420, 420) 
pointing to the "Install bestsell Servlet on Host System "X"" task (435) and "Install ordrdisp 
Servlet on Host System "X"" task (440), respectively. 

• The "Host Y sequence" consists of the following tasks and links: The "Install 

10 Database on Host System "Y"" task (405) has one outgoing "FS"-type link (410) pointing to 
the "Install CC.XACTS Table on Host System "Y"" task (415). 

• Finally, one link (430) crosses the two per-host sequences, because the The "Install 
CC_XACTS Table on Host System "Y"" task (415) must be finished ("FS" type relationship 
constraint) (430) before the "Install ordrdisp Servlet on Host System "X"" task (440) can be 

15 started. 

The time axis (445) is used to illustrate the order in which these tasks need to be carried out. 
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Once the Task Graph is built, the Task Graph Builder can proceed with the last step, 
namely constructing the Annotated Task Graph by assigning the estimated durations to every 
task within the task graph and computing the makespan for the overall job represented by the 
Task Graph. 

5 Referring now to Figure 5 , the annotated task graph as a result of the task annotation 

step is depicted, according to an embodiment of the invention. It is generated by the Task 
Graph Builder (110) by invoking the Task & Job Duration Estimator (120) for every 
individual task, annotating the Task Graph (165) with this data, and computing the makespan 
for the overall job (and annotating the process with this data) represented by the Task Graph. 

1 0 The results are as follows: 

• The "Install Servlet Container on Host System "X" task (500) has a duration of 40 
minutes (510). 

• The "Install bestsell Servlet on Host System "X"" task (535) has a duration of 4.5 
minutes (550). 

15 • The "Install ordrdisp Servlet on Host System "X"" task (525) has a duration of 4 

minutes (540). 
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• The "Install Database on Host System "Y"" task (515) has a duration of 45 
minutes (520). 

• The "Install CC_XACTS Table on Host System "Y"" task (505) has a duration of 
15 minutes (530). 

5 Consequently, the makespan for the overall process is 1 hour and 4 minutes. This annotated 
task graph (165) is then returned to the administrator (100). 

It should be noted the, as mentioned above, the involved systems are referred to by their 
role (Application Server, Database Server) instead of their name. This facilitates the 
applicability of the same task graph to multiple systems in case multiple systems playing the 
10 same role are needed to fulfill a request for change (105). 

Referring now to Figure 6, a flow diagram illustrates steps of actions for determining 
the order of changes and task durations, according to an embodiment of the present invention. 
The algorithm begins at block (600) and proceeds as follows: Upon receipt of a new RFC, the 
Task Graph Builder extracts the relevant parameters from the RFC (605): Examples of such 
15 parameters include, but are not limited to: The name of the artifact that needs to be changed, 
and the target system names, and the change management operation. The Task Graph Builder 
then proceeds with mapping the logical target names to a list of physical target systems. This is 
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done by invoking the VMResolver component (115) from the Task Graph Builder (110), and 
by retrieving the results (610). Different procedures need to be applied according to the type of 
change management operation specified in the RFC: For example, an RFC may specify an 
INSTALL (615), an UPDATE (620), or a CONFIGURE (625) change management 
5 operation. One having skill in the relevant art will recognize that modifications and extensions 
of the change management operations may be made without departing from the spirit and 
scope of the invention. In the first case, the task graph builder would invoke a recursive drill- 
down operation (630) on the dependency service (125), The latter would return a list of 
artifacts that would need to be installed as well. 

10 In the second case, the task graph builder would invoke a recursive drill-up operation 

(635) on the dependency service (125) to retrieve a list of artifacts that would actually be 
impacted by the UPDATE change management operation. 

In the third case, the task graph builder would invoke a recursive drill-up operation 
(645) on the dependency service (125) to retrieve a list of artifacts that would actually be 
1 5 impacted by the CONFIGURE change management operation. 

After these steps have been performed and depending on what change management 
operation has been specified in the RFC, the task graph builder would create in each case the 
task relationships as well as the relationship constraints (645) from the data returned by the 
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dependency service. Then the task graph builder performs the task consolidation step (650) to 
generate a Task Graph, which needs to be subsequently annotated with the task and job 
duration estimates. To do so, the task graph builder invokes the task and job duration 
estimator (120), retrieves the results and proceeds with annotating the task graph (655). This 
annotated task graph is then returned to the administrator for further processing (660). 

Finally, the task graph builder verifies whether one or more new RFCs have arrived 
for which the procedure needs to be repeated (665). If this is the case, the algorithm proceeds 
to step (670), retrieves the RFC and transitions (675) to step (605). Otherwise, the algorithm 
ends at block (680). 

Referring now to Figure 7, a flow diagram illustrates steps of actions for the 
construction of a Task Graph from dependency information, according to an embodiment of the 
present invention. The algorithm begins at block (700) and proceeds as follows: Upon receipt 
of a Dependency Graph (705) from the Dependency Service (125), the outermost element of a 
workflow (a process container) is created (710) in which all other elements of a Task Graph 
will be subsequently inserted. The dependency graph contains a list of artifact tuples; the 
overall list of tuples is called candidate list. We assume that every tuple contains first the 
antecedent, and then the dependent artifact, each prefixed with the name of the change 
management operation the administrator (100) has specified in the RFC (105), and having the 
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hostname as suffix. Every element of a tuple can then be referred to as task; the first element 
is the predecessor task, the second one is the successor. An example of a task is "Install Servlet 
Container on host "X"". In addition, every tuple stores a precedence constraint (such as FS, SS, 
SF, FF) that annotates the dependency link. One having skill in the relevant art will recognize 
5 that modifications and extensions to the way the dependency graph is represented may be made 
without departing from the spirit and scope of the invention. First, the algorithm determines if 
the list of candidates is empty (i.e., no tuples are present) (715). 

If the list of candidates contains one or more tuples, the algorithm proceeds to block 
(720) and selects a tuple from the candidate list. No assumptions are being made with respect 

10 to the order of the tuples in the candidate list. In addition, the tuple selection can happen in 
any order, since the candidate list is essentially an unordered bag of tuples. Once a tuple is 
selected, the precedence constraint is read and stored for further processing (725). Then, the 
algorithm determines if the tuple is empty (i.e., no tasks are present) (730). If this is not the 
case, the algorithm selects a task from the tuple. Here, the order in which the tasks are chosen 

1 5 matters, because the first element of a tuple is the predecessor, while the latter task within the 
tuple is the successor. Once a task has been selected (740), the algorithm proceeds with 
extracting the hostname from the task by applying a simple read operation to the task suffix 
(745). Next, the algorithm determines if a sequence (a container that stores tasks in a partial 
order) for the chosen host already exists (750). If this is not the case, the algorithm creates 
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such a host sequence (755) and subsequently inserts the currently selected task in the host 
sequence (760). Otherwise, the algorithm checks if the task already exists in the host sequence 
(775). This is needed to prevent duplicate tasks in a host sequence. If the task is not already 
part of the host sequence, it is inserted (760); otherwise, the algorithm proceeds to block 
(765). Finally, the task is removed from the tuple because it has already been processed (765). 

The algorithm verifies again if the tuple is empty (770) and proceeds to block (780) if 
there is still a task in the tuple. A remaining task is by definition the successor task, as the 
predecessor has already been removed from the tuple in step (765) and placed into the host 
sequence. Consequently, block (780) inserts an outgoing link reference (i.e., a pointer to a 
successor task, referring to the successor by its name) in the task of the host sequence. The 
algorithm proceeds then to block (740) and applies the task procedure (blocks 740 to 765) to 
the remaining successor task, and removes this task from the tuple as well afterwards (765). 
This is needed because the successor task may well refer to a different host for which a host 
sequence may either already exist, or not (cf. block (750)). In addition, a check for task 
duplicates in block (740) needs to be carried out for the successor task as well. After the 
removal of the successor task, the tuple is then empty, and the check in block (770) yields a 
positive result. The algorithm then proceeds to block (785), where the precedence constraint 
that is both embedded in the link reference of the incoming link reference of the successor task 
being contained in the host sequence (potentially from a previous iteration of the algorithm) is 
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compared to the one that is kept in memory (cf. block (725)) for the current tuple instance. 
This is needed to ensure that the precedence constraint specified in the most recendy inserted 
tuple is consistent with a precedence constraint between the same tasks that may have been 
inserted into the host sequence previously. 

If the algorithm determines in block (785) that the newly inserted precedence 
constraint is different from the one already stored, the algorithm exits with an error condition 
in block (790) and subsequently end at block (799). This check needs to be carried out only 
once for the incoming link reference of the successor task, because its precedence constraint is 
by definition identical to the one stored in the outgoing link of the predecessor task. 
Otherwise, the algorithm proceeds to block (795) and inserts an incoming link reference into 
the successor task before continuing at block (735) with the removal of the already processed 
tuple from the candidate list. Then, the algorithm proceeds to block (720) and determines if 
the procedure needs to be repeated for one or more additional tuples contained in the candidate 
list. If, however, no more tuples remain for processing (i.e., the list of candidates is empty), the 
completed task graph is then returned to the invoker (797). The algorithm ends at block (799). 

Referring now to Figure 8, examples of the Task Graph Builder APIs are shown. The 
table includes base APIs that can generate, send and request receipt of partial orders of change 
management tasks for a given service and host name. Those skilled in the art will appreciate 
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that the APIs can use one or more parameters (not shown) to identify characteristics (specified 
in the Functional Description) used by the APIs. Specifically, the 

getTaskGraphForlnstall(parameters) API builds the Task Graph for the INSTALL change 
management operation based on a recursive "Drill-Down", carried out by the Dependency 
5 Service. The getTaskGraphForUpdate(parameters) API builds the Task Graph for the 
UPDATE change management operation by invoking a recursive "Drill-Up" on the 
Dependency Service, i.e., it retrieves all the dependents of a given artifact, i.e., the artifacts in 
the dependency hierarchy that are likely to be affected by an UPDATE change management 
operation. The getTaskGraphForUninstall(parameters) API builds the Task Graph for the 

10 UNINSTALL change management operation. The getTaskGraphForRollback(parameters) 
API builds the Task Graph for the ROLLBACK change management operation, which is the 
opposite operation of UPDATE and restores the previously updated version of an artifact. 
The getTaskGraphForInitialConfigure(parameters) API builds the Task Graph for the 
INTTIALCONFIGURE change management operation, which applies basic configuration 

1 5 settings to an artifact, which are needed to install it in the first place. The 

getTaskGraphForConfigure(parameters) API builds the Task Graph for the CONFIGURE 
change management operation, which applies advanced configuration settings to an artifact so 
that it can be customized. 
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For each of the APIs, an administrator is able to customize the results by indicating 
whether he is interested in retrieving simply the task graph, or the annotated task graph that 
contains the task duration estimates in addition to the task graph. This is done by setting a "no 
duration estimates" flag, an input parameter to the APIs, upon invocation. The annotated task 
5 graph is assumed to be the default. 

It is to be understood that the present invention, in accordance with at least one 
presently preferred embodiment, includes an for determining existing relationship 
descriptions between components of the system; an arrangement for transforming, acquired 
relationships into ordered tasks that are linked by temporal ordering constraints; and an 
10 arrangement for creating an order of changes taking into account task relationship constraints. 
Together, these may be implemented on at least one general-purpose computer running 
suitable software programs. These may also be implemented on at least one Integrated 
Circuit or part of at least one Integrated Circuit. Thus, it is to be understood that the 
invention may be implemented in hardware, software, or a combination of both. 

15 If not otherwise stated herein, it is to be assumed that all patents, patent 

applications, patent publications and other publications (including web-based 
publications) mentioned and cited herein are hereby fully incorporated by reference herein 
as if set forth in their entirety herein. 
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Although illustrative embodiments of the present invention have been described 
with reference to the accompanying drawings, it is to be understood that the invention is 
not limited to those precise embodiments, and that various other changes and 
modifications may be affected therein by one skilled in the art without departing from the 
5 scope or spirit of the invention. 
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